PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Sopen10g030300.2
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; asterids; lamiids; Solanales; Solanaceae; Solanoideae; Solaneae; Solanum; Lycopersicon
Family bHLH
Protein Properties Length: 1417aa    MW: 157673 Da    PI: 6.0305
Description bHLH family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Sopen10g030300.2genomespennView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1HLH41.82e-1371123155
                       CHHHHHHHHHHHHHHHHHHHHHHHCTSCC.C...TTS-STCHHHHHHHHHHHHHHH CS
               HLH   1 rrrahnerErrRRdriNsafeeLrellPk.askapskKlsKaeiLekAveYIksLq 55 
                       ++ +hn+ Er RR++iN  ++ Lr+llP+ +   + kKls  +++ +  +YI +Lq
  Sopen10g030300.2  71 KKLNHNASERDRRKKINGLYSSLRSLLPPsD---HTKKLSIPSTVSRILKYIPELQ 123
                       6789*************************44...6666***************998 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
PROSITE profilePS5088814.33570122IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SuperFamilySSF474592.49E-1470140IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
CDDcd000831.55E-1171127No hitNo description
Gene3DG3DSA:4.10.280.101.1E-1171137IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
PfamPF000109.5E-1171123IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
SMARTSM003532.8E-1076128IPR011598Myc-type, basic helix-loop-helix (bHLH) domain
Gene3DG3DSA:1.25.10.102.7E-35212664IPR011989Armadillo-like helical
SuperFamilySSF483713.06E-38218650IPR016024Armadillo-type fold
CDDcd000201.77E-7414529No hitNo description
CDDcd000200.00391537652No hitNo description
Gene3DG3DSA:3.60.15.101.9E-40681962IPR001279Metallo-beta-lactamase
SuperFamilySSF562811.19E-16682908IPR001279Metallo-beta-lactamase
PfamPF136918.4E-14688743IPR027794tRNase Z endonuclease
Gene3DG3DSA:3.60.15.101.5E-7311251403IPR001279Metallo-beta-lactamase
SuperFamilySSF562815.77E-4711251244IPR001279Metallo-beta-lactamase
PfamPF127061.3E-1011551395IPR001279Metallo-beta-lactamase
SuperFamilySSF562815.77E-4712911404IPR001279Metallo-beta-lactamase
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0042779Biological ProcesstRNA 3'-trailer cleavage
GO:0016891Molecular Functionendoribonuclease activity, producing 5'-phosphomonoesters
GO:0046983Molecular Functionprotein dimerization activity
Sequence ? help Back to Top
Protein Sequence    Length: 1417 aa     Download sequence    Send to blast
MLAISSSSSP LFSTTTNNFG WLLEDLISHE LTNSGETSNS SQKSLQHCDS NKFDQIIING  60
GDQYQPDQTV KKLNHNASER DRRKKINGLY SSLRSLLPPS DHTKKLSIPS TVSRILKYIP  120
ELQSEVERLV QKKEEFTSKN IFNKQKRIKG GIGNSSFVIS TSELGDKEIV IQISTLKINK  180
GSISEAISQL EDEGLVLLNA TSFETFEDRV FYTLHFQVEE SMRQAAETLR ELVNCGTMAQ  240
NQELLRRGVV TIFLQMLNDE EVIDLDDEEW LDDLLIVEAV KTLALLSTKV NVRGDLVTAF  300
VEQNQCRRLM LLFRLVIVEP VVSDLYLLGG QYKAVEILGN FARRSRRLRD LVIDSGEDLI  360
SLFRVLRSLA DDSIVAFDAS PAISAARTIG YLCFGSPPPP FDKLRPSLPI LRFLINQLEP  420
HIGERACPVI KHACLIVACL ARGGFDPINA LIDENICPIL VMLLAHPHSE VVASVLKVVE  480
NFLKNGTENQ IQVLHDNQVL QHVLEIVMNH DNLPPLHLRS VCRAIANIVN YWSSQIQRMV  540
DAGIFPSIIQ ISINQEVGDT KYEAIYAISS VVTRGSHEQI RHLVDHGSIA AICQGLLCED  600
YTRRASCFQA LRGILRVGEA HKVDGVNIYT QMITENGGLA KIKSQRDDRD VGEIARRLLS  660
SYWPGEKLME RNKDQNTQAY VQILGTGMDT QETSPSVLLC FDHERFIFNA GEGLQRFCTE  720
YKIKLSQVDH ICLTRVCSET TGGLPGLLLT LAGIKNGSSE SDDHVRIWGP PNLDLLVNAM  780
KTYVPHAVMT KKNIIPQSGS ALAPPLYVEE LRDVDKFKAV NISAFLLSPT QFSPNDTSIV  840
YICKLHDIRG KVDIVKAKAC GLEDKRKLGQ LQKGISVKSD LLDIEVHPDD VIGPPIPGPI  900
VLIVDCPTEP HAQELLSAQA LDAYYSDSQS NFTNVVNCII HLSPATVVNS PVYEKWMRKF  960
DSAQHIMGRA TRKHETTPIL ASSARIATRL HYLCPQFFPD PSFPSVQNDD DDVAAPNIKV  1020
PVESSVCGIS AENLLKFALR PPRKLGLDRS CVQNTMTSSV FIEELLSEIP EIAVAAKNIR  1080
KFWHKPEEDE VELSDRQDSN DVVIEEPSKF SVPKCLENVQ RDDLEIVFLG TGSSIPSKYR  1140
NVSSIYVNLF SKGGLLLDCG EGTLAQLKRR YGISGADTVV RNLRCIWISH IHADHHAGLA  1200
RILALRRDLL KGVEHEPILV VGPEKVGEFL KEYIKLEDLD MLFLDCWSTT RSKWDNTEAE  1260
DNSSQPCSKK LKPSTPLDDI TLLKCLRKVL GEAGLMRLIS FPVVHCDDAF GVVLESADRM  1320
NYGEVVPGWK VVYSGDTRPC SEVIDASLGA TILIHEATFE DGLVEEAIAR NHSTIKEALE  1380
VGDSAGAYRV ILTHFSQRYP KVPALVEVAR AGLVSCR
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4bpl_A2e-44333667105425IMPORTIN SUBUNIT ALPHA-1A
4bqk_B2e-44333667107427IMPORTIN SUBUNIT ALPHA-1A
4bqk_A2e-44333667107427IMPORTIN SUBUNIT ALPHA-1A
4b8p_B2e-44333667141461IMPORTIN SUBUNIT ALPHA-1A
4b8p_A2e-44333667141461IMPORTIN SUBUNIT ALPHA-1A
4b8o_A2e-44333667141461IMPORTIN SUBUNIT ALPHA-1A
2yns_B2e-44333667141461IMPORTIN SUBUNIT ALPHA-1A
2yns_A2e-44333667141461IMPORTIN SUBUNIT ALPHA-1A
Search in ModeBase
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankHG9754490.0HG975449.1 Solanum pennellii chromosome ch10, complete genome.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015055441.10.0PREDICTED: zinc phosphodiesterase ELAC protein 2-like
TrEMBLK4D2K80.0K4D2K8_SOLLC; Uncharacterized protein
STRINGSolyc10g079670.1.10.0(Solanum lycopersicum)
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G56970.12e-35bHLH family protein